🖥 PDF CRAFT-a python library for converting PDF (primarily scanned books) in Markdown and EPUB using local AI models and LLM to structure the contents Github
Basic possibilities
- extracting text and layout Uses the combination of Doclayout-Yolo and its own algorithms for detecting and filtering headlines, columns, footnotes and page numbers
- Local OCR Recognizes the text on the page via Onnxocr, supports acceleration on GPU (CUDA)
- Determining the order of reading With the help of LayoutReader, it builds a flow of text in the order in which it is perceived by a person
- Converting in Markdown Generates .MD with relative links to images (illustrations, tables, formulas) in the Assets folder
Installation and requirements Python ≥ 3.10 (recommended 3.10.16).
Pip Install PDF-Craft and PIP Install Onnxruntime == 1.21.0 (or Onnxruntime-GPU == 1.21.0 for CUDA).
For an EPUB conveier, you need access to the LLM service (for example, Deepseek).
🖥 PDF CRAFT-a python library for converting PDF (primarily scanned books) in Markdown and EPUB using local AI models and LLM to structure the contents Github
Basic possibilities
- extracting text and layout Uses the combination of Doclayout-Yolo and its own algorithms for detecting and filtering headlines, columns, footnotes and page numbers
- Local OCR Recognizes the text on the page via Onnxocr, supports acceleration on GPU (CUDA)
- Determining the order of reading With the help of LayoutReader, it builds a flow of text in the order in which it is perceived by a person
- Converting in Markdown Generates .MD with relative links to images (illustrations, tables, formulas) in the Assets folder
Installation and requirements Python ≥ 3.10 (recommended 3.10.16).
Pip Install PDF-Craft and PIP Install Onnxruntime == 1.21.0 (or Onnxruntime-GPU == 1.21.0 for CUDA).
For an EPUB conveier, you need access to the LLM service (for example, Deepseek).
The lead from Wall Street offers little clarity as the major averages opened lower on Friday and then bounced back and forth across the unchanged line, finally finishing mixed and little changed.The Dow added 33.18 points or 0.10 percent to finish at 34,798.00, while the NASDAQ eased 4.54 points or 0.03 percent to close at 15,047.70 and the S&P 500 rose 6.50 points or 0.15 percent to end at 4,455.48. For the week, the Dow rose 0.6 percent, the NASDAQ added 0.1 percent and the S&P gained 0.5 percent.The lackluster performance on Wall Street came on uncertainty about the outlook for the markets following recent volatility.
What Is Bitcoin?
Bitcoin is a decentralized digital currency that you can buy, sell and exchange directly, without an intermediary like a bank. Bitcoin’s creator, Satoshi Nakamoto, originally described the need for “an electronic payment system based on cryptographic proof instead of trust.” Each and every Bitcoin transaction that’s ever been made exists on a public ledger accessible to everyone, making transactions hard to reverse and difficult to fake. That’s by design: Core to their decentralized nature, Bitcoins aren’t backed by the government or any issuing institution, and there’s nothing to guarantee their value besides the proof baked in the heart of the system. “The reason why it’s worth money is simply because we, as people, decided it has value—same as gold,” says Anton Mozgovoy, co-founder & CEO of digital financial service company Holyheld.